Process Book

Overview and Motivation: Provide an overview of the project goals and the motivation for it. Consider that this will be read by people who did not see your project proposal.
Related Work: Anything that inspired you, such as a paper, a website, visualizations we discussed in class, etc.
Questions: What questions are you trying to answer? How did these questions evolve throughout the project? What new questions did you consider in the course of your analysis?
Data: Source, scraping method, cleanup, etc.
Exploratory Data Analysis: What visualizations did you use to look at your data initially? What insights did you gain? How did these insights inform your design?
Design Evolution: What are the different visualizations you considered? Justify the design decisions you made using the perceptual and design principles you learned in the course. Did you deviate from your proposal?
Implementation: Describe the intent and functionality of the interactive visualizations you implemented. Provide clear and well-referenced images showing the critical design and interaction elements.
Evaluation: What did you learn about the data by using your visualizations? How did you answer your questions? How well does your visualization work, and how could you further improve it?

Analysis of US Natural Disasters

Liam Shalon | liamshalon@wustl.edu | 457508

Ben Siderowf | bsiderowf@wustl.edu | 465346

https://csex57.github.io/rtnaturaldisasters/

Introduction

With more and more news stories about various natural disasters across the United States, it can be difficult to know whether a particular event is a big deal, or whether it’s just being overexaggerated for clicks. Our goal is to present a succinct, vivid and truthful visualization that shows the natural disasters from recent years, as well as the most recent real-time data so that users can draw informed conclusions about current-day disasters, and learn how they compare to similar events of the past. Our hope is that the analysis leads to more a accurate view of the world, which will help promote better decision making, and policies to address these crises into the future.

We would like to learn:

How disasters from recent years compare to the ones occurring today.
What particular disasters are impacting different regions of the US the most

Related Work

The USGS provides a map of real-time earthquakes:

https://earthquake.usgs.gov/earthquakes/map/?extent=9.96885,-144.22852&extent=58.90465,-45.79102

DroughtMonitor provides a map of current drought conditions across the US

https://droughtmonitor.unl.edu/Maps/MapArchive.aspx

Questions

One major question we came across to answer with respect to creating our visualization was how to best convey many thousands of data points to the user. Our dataset has ~1,000,000 data points in total. We obviously can’t show all of these to the user at once -- not only would it be impossible to comprehend, but no reasonable web browser would be able to handle that many SVG elements. We considered several different methods for condensing the information shown, but our final solution was to select the 1000 points with the largest magnitude for the user’s selected time range and draw those to the map.

Data Sources

The USGS provides a variety of real-time data streams about various natural phenomena, including earthquakes, wildfires, and droughts.

https://www.usgs.gov/products/data-and-tools/real-time-data

Drought data can also be found from DroughtMonitor, by the University of Nebraska-Lincoln and several federal agencies

https://droughtmonitor.unl.edu/DmData/DataDownload/ComprehensiveStatistics.aspx

Data Processing

We do not plan to do any significant data cleanup. All of our data is provided by APIs in standardized JSON/csv formats.
We may do compression on historical data if the bandwidth required to load it into the visualization becomes an obstacle.

Exploratory Data Analysis

One of our initial concerns about our project was about the size of the datasets we would be loading. For example, some initial research showed that one month of earthquake data might take as much as 10 MB of space, which could create problems for our goal of showing several years of historical data in our visualization. However, through several data filtering techniques and by storing the data in a more compact format, we were able to reduce the total data size to a manageable level.

By the end of it, we still ran into many problems due to the sheer size of the dataset. It created issues when rendering the circles on the map since it’d take a while to render tens of thousands of circles on the map. This informed our design because it meant that if we wanted to allow scrubbing, we had to project only some of the circles and not all of them - hence, we ordered the data and picked the 1000 most significant data points.

Design process

Brainstorm: The following two pictures are a general idea of what we are going to apply on final visualization. There are five alternative designs and we pick three that incorporate in final visualization.

General idea:

Filtered & Categorized:

Combine & Refine:

Question: We want to present the relationship between area and natural disasters to help customers realize their area related problems and prepare for potential disasters in risky months. The problem is that some regions might contain multiple natural disasters so colors will be too blurry to distinguish disasters’ kind.

Initial Design:

We have three charts that we want to put in our final visualization after filtered out from proposal visualized charts. They are a geographical map, a pie chart and a dynamic line chart.

Geographical Chart: We are going to graph a map and the features on this map should place where the highest frequency of certain natural disasters happened and we use different colors to show visualization. For example, if region A had the highest frequency of wildfire, our team would apply red to highlight the area.
Pie Chart: We are going to filter out top 3 states that have the highest frequency of our interested natural disasters to deliver alerts.
Dynamic Line Chart: This chart is dynamic that allows users to zoom in to see details. The x-axis will be “month/year” and y-axis will be “frequency of all natural disasters”

Focus/Zoom: Different colors represent different kinds of natural disasters.
Operation:

Explanation: Brushed-area. As mentioned in the layout section, we are going to apply a brush feature on the dynamic line chart to provide details.

Discussion: Delivering both detailed and general messages is the advantage of this technique and the disadvantage is that it needs time to be built.
Detail: dataset needs to be added to new columns that contain different dummy numbers to represent different kinds of natural disasters and sum them up. Also our team might use dummy numbers to represent regions if final visualization on map was scattered instead of concentrated. The dynamic feature and map might take 3-5 days to be built depending on how busy team members are in the following weeks. The software and libraries are available since we did almost the same things in previous weeks. The math would be if function and count function.

Final Design

Our must-have features:

Map visualization - to display similarity between locations of the natural disasters
Dynamic movement - to represent the time dimension

Brush - to zoom-in on detail since time slot is long

Ability to get specific on events or time frame…

Optional features:

Animations that make the thing more dynamic
Trend analysis of the natural disasters
Dynamic feature of “month & year” on the left top of dynamic chart- time switches as brushed area switches.

Reflection: our final version was very close to what we had originally proposed. Very happy about how it turned out and we feel that the overall look and usage of the product is on par with what we expected and desired.

Implementation

Over the course of our implementation, we changed our goals slightly. Our original vision for this project was to focus mainly on a real-time visualization of conditions across the US, but we ultimately decided to focus on creating a more robust visualization of historic data (across a 5-year period) instead.

Tooltips were implemented by having a callback on each SVG element with the provided metadata. Then we simply relocated a rectangle at the mouse hover, changed the opacity of the rectangle to 100%, and the text within it to match the element that the user was scrolling over.

The map itself is implemented as an SVG element with added panning and zooming from the D3 library.

The histogram was a simple version of adding multiple rectangles to a view and adding scrubbing on top of the element. Each data type on the map had a filter function that allowed for the data to easily filter by two dates and then we fed the filtered data into the map projection file where there were functions that converted those data points to dots or regions on the map.

The real time data button was a similar functionality that simply changed the underneath data to whatever was pulled in real time.

Evaluation

All in all, we feel that our project met expectations by showing a temporal view of natural disasters but also an opportunity to analyze deeper specific time periods. We learned a lot about how to manage large data sources and handle various latencies. We feel also that the one big way we could improve the design is by showing perhaps an animation of how the natural disasters have risen and gone out - and allow users to toggle between that view or not.